Design and Development of a Text-to-Speech Synthesizer for Afan Oromo
نویسندگان
چکیده
Abstract Speech is one of the natural ways communication between humans, later extended as a means for human–computer interaction. It helps visually impaired people to read electronic texts and used in information retrieval language education. This paper proposed development text-to-speech synthesizer Afan Oromo (Oromo Language), using unit selection speech approaches. Although several works have been conducted area synthesis technologically favored languages many years, every has its own unique features. So, systems developed cannot be another language, because structures are not presumably representative others. clear that each program based on system corresponding phonetic rules certain language. Besides, existing was reviewed this study result prototype results showing promising, however, still, their performance needs lot improvement terms intelligibility naturalness novel approaches quality corpus. Therefore, research initiated develop possibility developing improve synthesizer. In study, corpus collected from genuine sources prepared datasets both text audio collaboration with experts. The tested by proper users Mean Opinion Scale (MOS). obtained 4.44 (very good) out 5, which indicated encouraging better than TTS naturalness. But scored still further work. main challenge dialects, so preparing balanced dialect very tough. Moreover, enhancement work predicted bring reasonable level system.
منابع مشابه
A rule-based Afan Oromo Grammar Checker
Natural language processing (NLP) is a subfield of computer science, with strong connections to artificial intelligence. One area of NLP is concerned with creating proofing systems, such as grammar checker. Grammar checker determines the syntactical correctness of a sentence which is mostly used in word processors and compilers. For languages, such as Afan Oromo, advanced tools have been lackin...
متن کاملText-to-Audiovisual Speech Synthesizer
This paper describes a text-to-audiovisual speech synthesizer system incorporating the head and eye movements. The face is modeled using a set of images of a human subject. Visemes, that are a set of lip images of the phonemes, are extracted from a recorded video. A smooth transition between visemes is achieved by morphing along the correspondence between the visemes obtained by optical flows. ...
متن کاملA text-to-audiovisual-speech synthesizer for French
An audiovisual speech synthesizer from unlimited French text is here presented. It uses a 3-D parametric model of the face. The facial model is controlled by eight parameters. Target values have been assigned to the parameters, for each French viseme, based upon measurements made on a human speaker. Parameter trajectories are modeled by means of dominance functions associated with each paramete...
متن کاملCzech Text-to-Sign Speech Synthesizer
Recent research progress in developing the Czech – Sign Speech synthesizer is presented. The current goal is to improve the system for automatic synthesis to produce accurate synthesis of the Sign Speech. The synthesis system converts written text to an animation of an artificial human model. This includes translation of text to sign phrases and its conversion to the animation of an avatar. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SN computer science
سال: 2022
ISSN: ['2661-8907', '2662-995X']
DOI: https://doi.org/10.1007/s42979-022-01306-7